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Abstract. Generalized majority-minority (GMM) operations are introduced as a com- 
mon generalization of near unanimity operations and Mal'tsev operations on finite sets. 
We show that every instance of the constraint satisfaction problem (CSP), where all con- 
straint relations are invariant under a (fixed) GMM operation, is solvable in polynomial 
time. This constitutes one of the largest tractable cases of the CSP. 



1. Introduction 

Constraint satisfaction problems arise in a wide variety of domains, such as combina- 
torics, logic, algebra, and artificial intelligence. An instance of the constraint satisfaction 
problem (CSP) consists of a set of variables, a set of values (which can be taken by the 
variables), called domain, and a set of constraints, where a constraint is a pair given by 
a list of variables, called scope, and a relation indicating the valid combinations of values 
for the variables in the scope; the goal is to decide whether or not there is an assignment 
of values to the variables satisfying all of the constraints. It is well known that the CSP 
admits several different but equivalent definitions. Feder and Vardi |19j formulated it as the 
problem of deciding whether there exists an homomorphism between two given relational 
structures. Also, an instance of the CSP can be viewed as a positive primitive sentence; the 
question is to decide whether or not the sentence is true. 

In its full generality the CSP is NP-complete. This fact motivates the project of iden- 
tifying restricted subclasses of the problem that are solvable in polynomial time. The 
most customary way to restrict the CSP is by fixing a set of relations T, generally called 
constraint language or basis and consider only instances of the CSP in which every re- 
lation appearing in a constraint belongs to T; this restricted version of the problem is 
generally denoted by CSP(r). Much effort has been devoted to the goal of isolating 
those constraint languages, T, for which its associated constraint satisfaction problem, 
CSP(r), is polynomial-time solvable. Despite the large amount of results in this direc- 
tion P0iniia0IHlinilII3IIIllIlliniinilIHlliniE21ESll21,a complete classification is still 
not known. 
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The other usual way to define subclasses of the general CSP is by restricting the possible 
scopes, not the relations, that can appear in a constraint. The state of affaires here is a 
way better: It has been proved [2U] that, under certain plausible assumptions, the tractable 
class identified in lS a is the only one, settling completely the question. 

Our goal in this paper is to introduce a general condition, such that every constraint 
language T satisfying this condition, leads to a subclass of the CSP, CSP(r), solvable in 
polynomial time. In order to place our result in context it will be necessary to include a 
short description of the cases of the CSP, known to be tractable. 

In our classification we shall distinguish between pure and hybrid tractable cases. Intu- 
itively, pure tractable cases are those that are explained by a simple and concrete combina- 
torial principle whereas hybrid tractable cases are those that can be reduced (not necessarily 
without a considerable degree of sophistication) to a combination of pure cases. Let us re- 
call here that this distinction is our rather personal attempt to classify the tractable cases 
of the CSP and does not pretend to be any claim about the "true nature" of the tractable 
cases. 

According to our view there are basically three pure maximal tractable cases: width 1, 
bounded strict width, and Mal'tsev problems. This classification corresponds to the three 
tractable families isolated in Feder and Vardi [101 : width 1, bounded strict width, and 
subgroup problems, in which the latter has been enlarged as to include all Mal'tsev problems, 
proven to be tractable recently by Bulatov [2] (see also [Jj). The vast majority of tractable 
cases of the CSP identified in the past (prior [HI,) fall in one of these three pure categories. 
As an illustrative example consider the six tractable cases in the boolean domain identified 
by Schaefer [22j, namely, 0- valid, 1-valid, Horn, dual-Horn, bijunctive and affine problems. 
The first four classes are particular instances of width 1 problems whereas the fifth and 
sixth class belong to bounded strict width and Mal'tsev respectively. 

In |10j . the pursuit of new tractable cases of the CSP took a new direction. The 
new class identified in ^01, the so-called paper-scissor-stone problems, could be regarded 
as constituted by an amalgam of Horn problems (and hence width 1 problems) and dual 
discriminator problems ^Hj; known to be particular instances of bounded strict width prob- 
lems. The algorithm devised in ^U] exploits the fact that the interaction between the width 
1 part and the bounded strict width part of the instance is very constrained. The class of 
paper-scissor-stone problems is the first hybrid tractable case. 

All tractable cases identified after ^U] with the notable exception of Mal'tsev problems 
are hybrid problems (the references ^ [U El [HI El constitute, up to the best of our 
knowledge, a complete list). Indeed, it is not daring to say that is very likely that much 
of the future progress in the study of the complexity of the CSP will come from a better 
understanding of the interaction between the different sources of pure tractability. 

A unifying framework for tractability of constraint satisfaction has been developed by 
Jeavons and coauthors in a sequence of papers culminating in [23]; the key theme of this 
framework is that it is generally possible to explain the tractability of a certain subclass 
of the CSP, CSP(r), by means of certain algebraic invariance properties of the relations in 
r. Consider, for instance, the class of bounded strict width problems: It is well known 
(see also [21]) that a constraint language T is bounded strict width if an only if there exists 
a near-unanimity operation ip, namely, an operation <p : A k — > A with k > 3 satisfying 

<p(x, y, y) = v(y, x, .., y) = ■ ■ ■ = tp(y, y, .., x) = y 
for all x, y G A, such that every relation in T is invariant under (p. 
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In a similar vein, a constraint language T is Mal'tsev if all its relations are invariant 
under a Mal'tsev operation, ie, an operation tp : A 3 — > A satisfying 

vixiViV) = v{y,V,x) = x, for all x,y G A. 

Similar characterizations, in terms of algebraic invariance properties, are known for the 
vast majority of tractable cases of the CSP. In fact, the connection between tractability of 
CSP and invariance properties is tighter, as it can be shown that the complexity of a given 
subclass of the CSP, CSP(r), depends only on the the set of operations under which T is 
invariant |21j . 

Generalized majority- minority operations first arose in the study of the learnability of 
relatively quantified generalized formulas fi . An operation tp : A k — > A with k > 3 is a 
generalized majority-minority ( GMM) operation if for all a, b G A, 

<p(x, y, -,y) = v(y, x, .., y) = ■ ■ ■ = tp(y, y, .., x)=y 

for all x, y G {a, b} 

or 

<p(x, y, y) = <p(y, y, -,x) = x for all x, y G {a, b}. 
GMM operations generalize both near-unanimity and Mal'tsev operations. Intuitively, 
an operation is GMM if for each 2-element subset of its domain acts either as a near- 
unanimity or as a Mal'tsev. In this paper we prove that every constraint language T in which 
all its relations are invariant under a GMM operation tp, gives rise to a subclass of the CSP 
solvable in polynomial-time. This new family of constraint satisfaction problems includes 
among other all bounded strict width and all Mal'tsev problems and, hence, constitutes 
one of the largest tractable classes of the CSP. Our algorithm exploits a feature which is 
already used -at least implicitely- in the known algorithms for Malt'sev and Near-unanimity 
problems: for any relation R invariant under a GMM operation tp it is always possible to 
obtain a "succint" representation, in the form of a relation G C R that generates R, i.e., 
such that R is the smallest relation invariant under tp containing G. Indeed, our algorithm 
for GMM problems can be viewed as a mixture of the algorithms for Near-unanimity and 
Malt'sev problems. However, it should be point out that the interaction between the two 
conditions is rather intricate. 

2. Preliminaires 

Let A be a finite set and let n be a positive integer. A ra-ary relation on A is any subset 
of A n . In what follows, for every positive integer n, [n] will denote the set {1, . . . ,n}. 

A constraint satisfaction problem is a natural way to express simultaneous requirements 
for values of variables. More precisely, 

Definition 2.1. An instance of a constraint satisfaction problem consists of: 

• a finite set of variables, V = {vi, . . . , v n }; 

• a finite domain of values, A; 

• a finite set of constraints {C\, . . . ,C m }; each constraint Q, I G [m], is a pair 
((%,. • .,v iki ),Si) where: 

— {yi x , . . . , Vi k ) is a tuple of variables of length k\, called the constraint scope and 

— Si is an A^-ary relation on A, called the contraint relation. 
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A solution to a constraint satisfaction instance is a mapping s : V — * A such that for 
each constraint Cj, I € [m], we have that (^(i;^), . . . , s^fe,)) belongs to Sj. Deciding whether 
or not a given problem instance has a solution is NP-complete in general, even when the 
constraints are restricted to binary constraints or the domain of the problem has size 
2 |12j . However by imposing restrictions on the constraint relations it is possible to obtain 
restricted versions of the problem that are tractable. 

Definition 2.2. For any set of relations T, CSP(T) is defined to be the class of decision 
problems with: 

• Instance: A constraint satisfaction problem instance V, in which all constraint re- 
lations are elements of T. 

• Question: Does V have a solution? 

In the last few years much effort has been devoted to the identification of those sets V 
for which CSP(r) is solvable in polynomial time (See [H El El El [HI El COl [TT1 [T^l CE1 ElZl 
CHlIiniESlESlEll). In order to isolate such "islands of tractability" it has been particulary 
useful to consider certain closure conditions on the relations of T. In order to make this 
more precise we need to introduce the following definition, which constitues the cornerstone 
of the so-called algebraic approach of the study of the CSP. 

Definition 2.3. Let ip : A k — > A be an fc-ary operation on A and let R be a n-ary relation 
over A. We say that R is invariant under ip if for all (not necessarily different) tuples 
ti = (*i> • • • >*n)> • • • >*k = • • • j *n) in -R, the tuple (p(t\, . . . , tk) defined as 

{ip(t\,...,tX),...,y{tl,...,t k n )) 

belongs to R. 

Given a relation R and an operation 93, we denote by (R)m the smallest relation S that 
contains R and that it is invariant under ip. Very often, the operation ip will be clear from 
the context and we will drop it writting (R) instead of (R)<p- 

Let ip : A k — > A be any operation on A. We denote by Inv(y?) the set containg all 
relations on A invariant under ip. 

The vast majority of constraint languages T such that CSP(r) is in PTIME can be 
expressed as Inv((p) for some operation ip. We refer the reader to the references pointed out 
in the introduction for a complete (up to the best of our knowledge) list of operations that 
lead to a tractable class of the CSP. In what follows we shall introduce only two families of 
operations, namely near-unanimity and Malt'sev, which will be particularly relevant to our 
work. 

Example 2.4. (Near-Unanimity Operations) An operation <p : A k — > A with k > 3 is 
near-unanimity (NU) if for all x, y € A, we have that 

<p{x, V, y) = <p(y, x, .., y) = ■ ■ ■ = <p{y, y, .., x) =y 

Tractability of CSP(Inv(</?)) for any arbitrary near-unanimity operation ip was proved in ^H] 
(See also (22]). 

Many well known tractable cases of the CSP, such as 2-SAT, or the family of CSP with 
implicative constraints |24| I13j are, in fact, particular instances of this general case. 

Example 2.5. (Mal'tsev Operations) An operation ip : A 3 — > A is Mal'tsev if for all 
x, y € A, we have 

f(x,y,y) = (p(y,y,x) = x 
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In [2] (see jjj for a simpler proof), it was shown that for every Malt'sev operation ip, 
CSP(Inv((£>)) is solvable in polynomial time. This general result encompasses some previ- 
ously known tractable cases of the CSP, such as CSP with constraints defined by a system 
of linear equations [23] or CSP with near-subgroups and its cosets [T§1 IT%] . 

The class of generalized majority-minority operations generalizes both near-unanimity 
and Mal'tsev operations. 

Definition 2.6. An operation cp : A k — > A with k > 3 is a generalized majority-minority 
( GMM) operation if for all a, b G A, either 

<p(x, y, y) = <p(y, x, .., y) = ■ ■ ■ = <p(y, y, .., x) =y ^ ^ 

for all x, y G {a, b} 

or 

V(x, y, -,y) = <p(y, V, -,x) = x for all x, y G {a, b} (2.2) 

Generalized majority-minority operations were introduced in the study of the learnabil- 
ity of relatively quantified generalized formulas [5]. 

Let us fix a GMM operation on a set A. A pair a, b G A is said to be a majority pair if 
ip on a, b satisfies (|2.1j) . It is said to be a minority pair if ip satisfies (|2.2j) . If a = 6 then we 
will say {a, 6} is a majority pair. 

In this paper we prove the following result: 

Theorem 2.7. For every GMM operation ip, CSP(Inv(</?)) is solvable in polynomial time. 
The proof is given in Section 0J 

3. Signatures and Representations 

Let A be a finite set, let n be a positive integer, let t = (t\, . . . ,t n ) be a n-ary tuple 
of elements in A, and let elements in [n]. By pr 4 ,.t we denote the tuple 

... , tj.). Similarly, for every n-ary relation R on A and for every i\, . . . ,ij € [n] we 
denote by pv ilt ^ R the j-ary relation given by {pr^ ^. t : t G ii}. Given a subset 
/ = {«!,..., i, } of [n] with ii < Z2 < • • • < ij we shall use pr 7 R to denote pr 4l ^ . R. 

Let n be a positive integer, let A be a finite set, let t, t' be n-ary tuples and let 
(i,a,b) be any element in [n] x A 2 with a 7^ b. We say that (t,t') witnesses (i,a,b) if 
P r i ... i-l t = P r i ... i-i t'j P^i t = a, and pr^ t' = b. We also say that t and t' witness (i, a, 6) 
meaning that (t,t ; ) witnesses (i,a,b). 

Let (p : A k — > A, k > 3, be a GMM operation and let R be any n-ary relation on A (not 
necessarily invariant under ip). We define the signature of ii relative to ip, Sig R C [n] x yl 2 , 
as the set containing all those (i, a, b) G [re] x t4 2 , with {a, 6} a minority pair witnessed by 
tuples in R, that is 

Sig R = {(i, a, 6) G [re] x A 2 : {a, 6} a minority pair , 
3t,t' G R such that (t,t') witnesses (i,a,b)} 

Fix a non- negative integer j. A subset R' of R is called a representation of i? relative to 
ip of order j if Sig^ = Sig R , and for every I C {1, . . . , n} with |I| < j, pr 7 i? = pr 7 ii'. We 
also say that R' is a j-representation of R relative to ip. Observe that for any n-ary relation 
there exists a j-representation with size bounded above by 2| Sig R \ + Y^ic[n] \i\<j I P r -f ^1- 
We call any such representation, a compact representation of R. Note: When the operation 
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p is clear from the context we shall drop the "relative to ip" with the implicit understanding 
that signatures and representations are relative to p. 

Example 3.1. Let A be a finite set, let ip : A k — > A, k > 3, be a GMM operation and let 
j and n positive integers. We shall construct a ^-representation R' of R = A n . 

Initially R' is empty. Fix any arbitrary element d in A. First, observe that Sig^. contains 
all (i, a, b) in [n] x A 2 where {a, b} is a minority pair. For each triple (i, a, b) in Sig R we add 
to R' two tuples t^, tL_, where is the tuple that has a in its ith coordinate and d elsewhere 
and, accordingly, is the tuple that has b in its ith coordinate and d elsewhere. Notice 
that (t^tLj witnesses (i,a,b). Hence, after adding to R' a corresponding pair t^,t^ for 
each (i, a, b) in Sig R we have that Sig^j = Sig^/. In a second step we add for each ii, . . . , iy 

with j' < j and %\ < i 2 < ■ ■ ■ < ij, and every a±, . . . , ay G A, the tuple t a 1 1 ' j ...' ) a J , that has a; 
in its iith coordinate for each I G [j'], and d elsewhere. It is easy to verify that we obtain 
a relation R! such that for all I C [n] with |/| < j, prjR' = A^ = pijR. Hence R' is a 
representation of R of order j. Observe that, indeed, R' is a compact representation of R. 

The algorithm we propose relies on the following lemma. 

Lemma 3.2. Let A be a finite set, let ip : A k — > A, k > 3 be a GMM operation, let R be a 
relation on A invariant under ip and let R' be a representation of R of order k — 1. Then 
(R') = R 

Proof. Let n be the arity of R. We shall show that for every i G [n], pr 1 ... j(-R') = Wi,...,i R 
by induction on i. The case i < k — 1 follows from pr : R' = V*i,...,k-i R- ^° ^ 
i > k and let a = (ai, . . . , a^) G pr 1 j i?. By induction hypothesis, for some 6j, the tuple 
a' = (ai, . . . , aj_i, 6j) belongs to pr 1 i{R')- In what follows we shall denote pr x i(R') as 
5. 

We consider two cases. 

Case 1. {ai,bi} is majority. 

In this case we show that, for every / C {l,...,i}, pr^a G pr^ S. We show it by 
induction on the cardinality m of /. The result is true for m < k — 1 due to the fact that 
R' is a (k — l)-representation of R. It is also true for every set I that does not contain 
i, since a' G S certifies it. Thus let I = {j±, . . . ,j m } be any set of indices 1 < j± < 32 < 
■ ■ ■ < jm = i with m > k and also let a* = pr^ a. To simplify the notation let us denote 
a* = (ci, . . . , c m ). By induction hypothesis, pr 7 S contains the tuples di = (d\,C2, ■ ■ ■ , c m ), 
d 2 = (ci,d 2 ,c 3 , . . . ,c m ), d m = (ci, . . . ,c m -i,d m ) for some di, . . . , d m G A. If for some 
i, Ci = di then we are done. Otherwise, we can assume that d m = b{ and c m = ai and 
henceforth {d m ,c m } is majority. If for some j, the pair {dj,Cj} is minority then we are 
done, because y(dj, dj, . . . , dj, d m ) = a*. Otherwise, {dj,Cj} is majority for any j. In this 
case we have <p(di, d 2 . . . , dk-i, dk) = a*. 

Case 2. {ai,bi} is minority. 

Since a and a' belong to pr x A R then there exists some tuples t,t' G R such that 
P r i i * = a an d P r i j = a '- Consequently, t and t' witness (i, ai, bi) and since S\g R i = 
Sig R we can conclude that pr 1 j i?' (and hence S) contains tuples c = [c\, . . . , Cj_i, a{) 
and c' = (ci, . . . , 6j) witnessing (z, a^, 6j). We shall show that a can be obtained from 
a', c and c' by applying operation ip. We need first an intermediate tuple; we define d = 
(di, . . . , di) as p(a', c' , . . . , c', c). Finally we obtain e = (ei, . . . , ej) as </?(a', a', . . . , a', d). 



GENERALIZED MAJORITY-MINORITY OPERATIONS ARE TRACTABLE 



V 



Let us see that the tuple obtained e is indeed a. For each I € [i — 1], if {ai, q} is a majority 
pair then di = <p(ai, q, . . . , q, q) = q, and consequently q = <p(ai, oj, . . . , aj, q) = a;. 
Otherwise, if {a/,q} is a minority pair, then = (p(ai, q, . . . , q) = a; and consequently 
q = y?(a;, o; . . . ,ai,af) = a\. Finally, let us look at the value of ej. Since {ai, bi} is a minority 
pair we have that di = </?(&«, bi, . . . , bi, ai) = ai and hence = </?(6j, bi, . . . , bi, ai) = ai and 
we are done. ■ 



4. Proof of Theorem I2T71 

We prove Theorem 12.71 by giving a polynomial-time algorithm that decides correctly 
whether a CSP(rnv(c/?)) instance has a solution. The structure of the algorithm mimics that 
of 0- 

Let V = {{v\, . . . ,v n }, A,{C\, . . . ,C m }) be a CSP(Inv(9j)) instance which will be the 
input of the algorithm. 

For each I € {0, . . . , m} we define Vi as the CSP instance that contains the first I 
constraints of V, that is Vi = ({«i, • • • , v n }, A, {C\, . . . , Cf\). Furthermore, we shall denote 
by Ri the n-ary relation on A defined as 

Ri = {(s(vi), . . . , s(v n )) : s is a solution of Vi) 

In a nutshell, the algorithm introduced in this section computes for each I E {0, . . . , m} 
a compact representation R[ of i?/. In the initial case (/ = 0), Vq does not have any 
constraint at all, and consequently, Rq = A n . Hence, a compact representation of R$ can 
be easily obtained as in Example 3. Once a compact representation R' Q of Ro has been 
obtained the algorithm starts an iterative process in which a compact representation R[ + i 
of is obtained from R[ and the constraint C;+i. This is achieved by means of a call to 
procedure Next, which constitutes the core of the algorithm. The algorithm then, goes as 
follows: 

Algorithm GMM(({ W1 , . . . ,v n ),A, {C\, ... , C rn })) 

Step 1 set R' as in Example 3 

Step 2 for each / £ {0, . . . , m — 1} do 

(let Ci+i be {{v h v iki+i ), S i+1 )) 
Step 2.1 set R' l+l := Next(^,ii, . . . ,i kl+1 ,Si +1 ) 

end for each 
Step 3 if R' m ^ return yes 
Step 4 otherwise return no 

Observe that if we modify step 3 so that the algorithm returns an arbitrary tuple in R' m 
instead of "yes" then we have an algorithm that does not merely solve the decision question 
but actually provides a solution. 

Correctness and polynomial time complexity of the algorithm are direct consequences 
of the correctness and the running time of the procedure Next: As it is shown in Section f4. 31 
(Lemma 14. lj) at each iteration of Step 2.1, the call Next(i?J,ii, . . . Si+i) correctly 
computes a compact representation of the relation {t € Ri : pr^,,^ t € Si + i} which is 
indeed Ri+i- Furthermore the cost of the call is polynomial in n, \A\, and |>Sz+i|, which 
gives as a total running time for the algorithm polynomial (see Corollary 14.21 for a rough 
approximation of the running time) in the size of the input. This finishes the proof of the 
correctness and time complexity of the algorithm, and hence, of Theorem 12.71 ■ 
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Let us remark that our emphasis here is on simplicity rather than in efficiency. In 
fact, much better time bounds that the ones provided in our analysis can be obtained by 
improving the code using data structures and by performing a more accurate analysis of 
the running time. 

The remainder of the paper is devoted to defining and analyzing procedure Next. In 
order to define procedure Next it is convenient to introduce previously a pair of procedures, 
namely Nonempty and Fix-values, which will be intensively used by our procedure Next. 

4.1. Procedure Nonempty. This procedure receives as input a fc-order compact represen- 
tation R' of a relation R invariant under <p, a sequence i±, . . . ,ij of elements in [n] where n 
is the arity of R, and a j'-ary relation S also invariant under <p. The output of the procedure 
is either an n-ary tuple t G R such that pr ix . ^. t G S or "no" meaning that such a tuple 
does not exist. 

Procedure Nonempty(i?', i\, . . . , ij, S) 
Step 1 set U := R' 

Step 2 while 3ti, t2, . . . , tk G U such that 

I"';. ; ¥>(ti,t 2 ,...,t k ) Pi',. ij' do 

Step 2.1 set U := U U {<p(ti,t 2 , . . . ,t k )} 
endwhile 

Step 3 if 3t in U such that pr il; ^ t G S then return t 
Step 4 else return "no" 

We shall start by studying its correctness. First observe that every tuple in U belongs 
initially to R' (and hence to R), or it has been obtained by applying ip to some tuples 
ti,t2, • • • ,tk that previously belong to U. Therefore, since R is invariant under <p, we can 
conclude that during all the execution of the procedure U C R. Consequently, if a tuple t 
is returned in step 3, then it belongs to R and also satisfies that pr^ ^. t G S, as desired. 
It only remains to show that if a "no" is returned in step 4 then there does not exist any 
tuple t in R such that pr ix . ^ t G S. In order to do this we need to show some simple facts 
about U. Notice that at any point of the execution of the procedure R' C U. Then U is 
also a representation of R and hence (U) = R. Therefore we have that 

(P r n,..,^ U) = pT iu _ ti .{U) = pr,. , R 

By the condition on the "while" of step 2 we have that when the procedure leaves the execu- 
tion of step 2 it is necessarily the case that for all ti, t 2 , . . . , tk G U, pr il; ^ <^(ti, t 2 , • • • , tk) G 
P r u,...,ij U an d consequently pr il; ^ U = {pT ilt i . U) = pr^ ^ R. Hence, if there exists 
some t in R such that pr^ ^ t G S then it must exist some t' in U such that pr ili ^ t' G S 
and we are done. 

Let us study now the running time of the procedure. It is only necessary to focus on 
steps 2 and 3. At each iteration of the loop in step 2, cardinality of U increases by one. So 
we can bound the number of iterations by the size \U\ of U at the end of the execution of 
the procedure. 

The cost of each iteration is basically dominated by the amount of computational 
time needed to check whether there exists some tuples 3ti, t 2 , . . . , tk G U such that 
P T h,...,ij y(ti,t 2 , . . . ,tk) P 1 *!,...,* - U in step 2. In order to try all possible combinations 
for ti,t 2 ,...,tk in U, \U\ k steps suffice. Each one of these steps requires time 0(\U\n), 
as tuples have arity n and checking whether </?(ti,t 2 , . . . ,tk) belongs to U can be done 
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naively by a sequential search in U. Thus, the total running time of step 2 is 0{\U\ k+1 n). 
The cost of step 3 is the cost of finding a tuple t in U satisfying pr iXi ^ t £ S which is 
0(|C/| |5|re). Putting all together we obtain that the complete running time of the proce- 
dure is 0{\U\ k+2 n + \U\\S\n) which we can bound by 0(|{7| + IS^n). So, it only remains to 
bound the size of U (at the end of the execution of the procedure). The size of U can be 
bounded by the initial size of R' which is at most 0(n|>l| 2 + n fc |^4| fc ) = 0((n |^4|) fc ) (since 
R' is compact) plus the number of iterations in step 2, which is bounded by | pr il(>>iji R\. 
Consequently the total running time of the procedure can be bounded by 

o(((n|A|)* : + |pr iW . J R|) fe+2 |5|nV 

We want to remark here that the size of pr^ ^ R can be exponentially large in the size 
of the input. For now we do not deal with this issue. Later we shall see how, in order to 
overcome this difficulty, we organize invoking to Nonempty in such a way that the value of 
P T ii,...,ij R is conveniently bounded. 

4.2. Procedure Fix-values. This procedure receives as input a compact representation 
R' of a relation R invariant under ip and a sequence a\, . . . , a m , m < n of elements of A (n 
is the arity of R). The output is a compact representation of the relation given by 

{t £ R : pr x t = ai, . . . ,pr m t = a m } 

Figure ^ contains a description of the procedure. Let us study its correctness. We 
shall show by induction on j £ {0, . . . , m} that Uj is a compact representation of Rj = 
{t £ R : pr x t = ai, . . . , pr^ t = a,j}. The case j = is correctly settled in step 1. Hence 
it is only necessary to show that at every iteration of the while loop in step 2, if Uj is a 
compact representation of Rj then Uj+i is a compact representation of Rj+i- We shall start 
by showing that at the end of the execution of step 2.2, Sig;j. +1 = Sig# . It is easy to 
see that if any of the conditions of the "if" in step 2.2.1 is falsified then (i,a,b) is not in 
Sigj^ . So it only remains to see that when the "if" in step 2.2.1 is satisfied, we have that 
(a) (ti,t5) witnesses (i, a, b), and (b) ti and ts are tuples in Rj+i, 

Proof of (a): We shall first show that for each I £ [i— 1] , pr ; ti = pr ; t$. Let q to be pr ; ti 
and let d\ = pr ; t2 = pr ; t3. If {q, d[} is a majority pair then pr ; = ip(ci,di, . . . ,di, di) = di 
and hence pr z ts = (p(ci, q, . . . , q, di) = q. Otherwise, if {ci,d[} is a minority pair, then 
pr z t 4 = ip(ci,di, ... , di,d t ) = Q and consequently pr z t 5 = <p(ci,ci, . . . , q, q) = q. So it only 
remains to show that pr^ ti = a and prj ts = b. We have pr^ ti = a as a direct consequence 
of the fact that ti is the tuple returned by the call to Nonempty(f7j , j + 1, i, {(aj+i, a)}). 
Observe also that as (tajta) witnesses (i,a,b) we have that pr^t2 = a and pr^ t3 = b. 
Consequently, since {a, b} is a minority pair we have that prj t 4 = ip(a, a, . . . ,a,b) = b and 
hence prj ts = cp(a, a, . . . , a, b) = b, as desired. 

Proof of (b): As ti is the output of the call Nonempty({7j, j + {(aj + ±, a)}), we can 
conclude that ti belongs to Rj, pr J+1 ti = etj+i, and prj ti = a. Consequently ti belongs to 
Rj+\. Furthermore, as ti, t2, and t3 are in Rj and Rj is invariant under 93, we can conclude 
that ts belongs to Rj. Thus in order to see that ts belongs to Rj+i it only remains to show 
that pr J+1 t5 = a,j+i. This can be obtained as a direct consequence of (a), since as (ti,ts) 
witnesses (i, a, b) and i > j + 1, we have that a^+i = pr,- +1 ti = pr J+1 ts. 

We have just seen that at the end of step 2.2, Sigjj = Sig# . In Step 2.3, procedure 
Fix-values enlarges Uj+i so that for every set I C [n] with |/| < k — 1, pr^ Uj+\ = 
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Procedure Fix-values (R', a±, . . . , a m ) 
Step 1 set j := 0; Uj := R' 

Step 2 while j < m do 

Step 2.1 set U j+1 := 

5iep 2.2 for each (i,a,b) G [n] x ^4 2 , {a, 6} is a minority pair, do 

Step 2.2.1 if Nonempty (£/,-, j + 1, i, {(a j+1 , a)}) /"no" and 

(i, a, b) G Sig f7j and i > j + 1 then 

(let ti be the tuple returned by Nonempty^-, j + 1, i, {(oj+i, a)}) 
and let t2,t3 be tuples in [Vj witnessing (z, a, b) ) 
Step 2.2.1.1 set t 4 := <p(ti,t 2 , . . . ,t 2 ,t 3 ) 

Step 2.2.1.2 set t 5 := ^(ti,ti, . . . ,ti,t 4 ) 

Step 2.2.1.3 set U j+1 := U j+1 U {ti,t 5 } 

end for each 
Step 2.3 for each k' G [k - 1] 

for each l±, . . . ,1^' £ N with l\ < I2 < ■ ■ ■ < lk> 
for each d\,... ,dk' G i do 
S'tep 2.3.1 if Nonewpty(Uj,li, ... ,lk>,j + l,{(di, ... ,dk>,aj + i)}) /"no" then 

(let t6 be the tuple returned by the call 
to Nonempty^-, Zi, . . .,l k ',3 + 1, {(^l, ■ • ■ , dh>, Oj+i)})) 
set f7j + i := t/j+i U {t 6 } 
end for each 
Step 2.4 set j := j + 1 

end while 
S'tep 5 return ?7 m 



Figure 1: Fix- values 

pijRj+i. The proof of this fact is rather straightforward. Let k', li,...,l k ', di,...,d k i 
be the running parameters of a given interation of step 2.3. It is easy to observe that 
the call to Nonempty([/j, h, . . . , l k ',j + 1, {(c?i, . . . ,d k i, Oj+i)}) is different than "no" if and 
only if (di, . . . , dy) G pr (l ; , Rj+i- Furthermore, if the call returns a tuple t6 then we can 
guarantee that t6 belongs to Rj+i and that pr 7 = (d±, . . . , dk>). We have just seen that at 
the end of the execution of step 2.3 for every set I C [n] with |/| < k—1, pr 7 Uj+i = pr 7 Rj+i. 

Notice that at each iteration of step 2.2, at most 2 tuples are added for each (i, a, b) in 
Sig^j . Furthermore at each iteration of step 2.3, at most one tuple is added per each k', 
I with \I\ < k — 1 and tuple in pr^ Rj+i- Consequently, Uj + i is compact. This completes 
the proof of its correctness. 

Let us study now its time complexity. The "while" loop at step 2 is performed m < n 
times. At each iteration the procedure executes two loops (Step 2.2 and Step 2.3). The 
"for each" loop at step 2.2 is executed for each (i, a, b) in [n] x A 2 with {a, b} a minority 
pair. That is a total number of times bounded by n|^4| 2 . The cost of each iteration of the 
loop in Step 2.2 is basically dominated by the cost of the call to procedure Nonempty which 
costs 0{((n\A\) k + \A\ 2 ) k+2 n) which is 0((n|A|) fc ( fc+2 ) +1 ). The "for each" loop at step 2.3 
is executed 0((n\A\) k ) times. The cost of each iteration is basically the cost of the call 
to Nonempty which is 0(((n\A\) k + l^^ra) = 0{{n\A\) k ( k+2 ^+ l ). Thus the total cost of 
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Procedure Next -bet a(i?', i\, . . . , ij, S) 

Step 1 set U := 

Step 2 for each (i,a,b) £ [n] x A 2 , {a, b} is a minority pair do 
Step 2.1 if Nonempty (R',ii, ... ,ij,i,S x {a}) /"no" then 

(let ti be Nonempty^', i\, . . . ,ij,i, S x {a})) 
Step 2.2 if Nonempty(Fix-values(i?', pr 1 ti, . . . , pr^ ti), i\, . . . S x {b}) 7^" no" 
(let t2 be Nonempty(Fix-values(i?', pr x t, . . . , pr^ t), i\, . . . , ij, i, S x {6})) 
set U := J7U{ti,t 2 } 
end for each 
Step 3 for each k' £ [k — 1] 

for each Zi, . . . , £ [n] with Zi < ?2 < • • • < h' 
for each d\ , . . . , dy £ A do 
^iep 3.i if Nonempty (R' , ii, . . . , ij, li, . . . , ly, S X {(d\, . . . , dk')}) 7^"no" then 
(let t 3 = Nonempty^', ii, . . . ,ij,l\, . . . ,h',S x {(rfi, . . .,d k >)})) 
set U := U U {t 3 } 
Step 4 return U 



Figure 2: Next-beta 

step 2.3 is 0((n\A\) k ( k+2 ^ +1+k ) . Thus the combined cost of steps 2.2 and 2.3 is dominated 
by the cost of 2.3 which gives as a total cost of 0((n\A\) k ( k+2 ^ +1+k ) for each iteration of 
step 2. Since step 2 is executed at most n times we have a total cost of the procedure of 
0{(n\A\) k{ - k+ V +1+k n) which we shall bound by 0{{n\A\)^ k+1 ^ k+2 ^). 

4.3. Procedure Next. We are now almost in a position to introduce procedure Next. 
Procedure Next receives as input a compact representation R' of a relation R invariant 
under (p, a sequence i\, . . . ,ij of elements in [n] where n is the arity of R, and a j-ary 
relation S invariant under (p. The output of Next is a compact representation of the relation 
R* = {t £ R : pr il; j. t £ S}. It is an easy exercise to verify that R* must also be invariant 
under ip. 

We shall start by defining a procedure, called Next -beta that although equivalent to 
Next has a worse running time. In particular, the running time of Next -beta might be 
exponential to the arity j of S (recall that arities can be unbounded as we allow infinite 
constraint languages). 

Figure |2] contains a description of the procedure. The overall structure of procedure 
Next -beta is similar to that of procedure Fix-values. The procedure constructs a rep- 
resentation U of R*. Initially U is empty. In step 2, Next-beta adds tuples to U so that 
when it leaves the execution of step 2, Sig^ = Sig R . Let us analyze step 2. Observe that 
the condition of the "if" statement 

Nonempty(i?', i\, . . . , ij, i, S x {a}) ^ "no" 

of step 2.1 is satisfied if and only if there exists a tuple ti £ R such that pr^ ^ ti £ S and 
prj ti = a. Hence if such a tuple does not exist then (i, o, b) is not in Sig^, and nothing 
needs to be done for (i,a,b). Now consider the condition of the "if" statement in step 2.2 
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which is given by 

Nonempty(Fix-values(i?', pr x ti, . . . , pr^ ti), 

ii,...,i J -,i,5x{6})^»no" 

This condition is satisfied if and only if there exists some t2 in R such that pr^ ^. t2 € 
"S 1 ) P r i i-i ^2 = P r i i-i ti and pr^ t2 = b. It is immediate to see that if the condition holds 
then t2 G R* and (ti, t2) witnesses (i, a, b). It only remains to show that if (i, a, b) G Sig R , 
then such a t2 must exist: Let t a ,tb be tuples in R* witnessing (i,a,b) and let ti be the 
tuple returned by the call to procedure Nonempty in step 2.1. In order to prove the existence 
of t2 we shall use the usual trick. First define a tuple u as </?(ti , t a , . . . , t a , tb) and finally let 
us define t2 as <^(ti,ti, . . . , ti,u). Since ti,t a ,tb belong to R* and R* is invariant under 
ip we can conclude that t-z belongs to R*. Let us show that pij j_ L t2 = p^ i-i^i' For 
each I £ [i — 1], let q be pr^ti and let di be pr^t a = pr^tb- If {ci,di} is a majority pair 
then pr ; u = <p(ci, di, . . . , di,d{) = d\ and hence pr ; t2 = (f(ci, q, . . . , q, d{) = q. Otherwise, 
{ci,di} is a minority pair and hence pr ; u = tp(ci, di, . . . , dj, dj) = q. Consequently, pr^ t2 = 
ip(ci,ci, . . . ,q,q) = q and we are done. Finally we need to see that pr^t2 = b. Observe 
that since {a, b} is a minority pair we have that pr f u = ip(a, a, . . . ,a,b) = b and hence 
prj t2 = ip(a, a,...,a,b) = b. 

We have just proved that, if U is the representation output by the procedure in Step 
4, then Sigjy = Sig K . . It is straightforward to verify that step 3 guarantees that for each I 
with |7| < k — 1, pr 7 U = pr 7 R*. The analysis here is basically identical to that of step 2.3 
in procedure Fix-values. Consequently, U is a representation of R*. 

At each iteration of step 2, at most 2 tuples are added for each (i,a,b) in Sig^j*. 
Furthermore at each iteration of step 3, at most one tuple is added per each k 1 , I, with 
\I\ < k — 1 and tuple in pr 7 Rj+i- Consequently, U is compact. This completes the proof 
of its correctness. 

Let us study the running time of procedure Next -beta. The loop of step 2 is performed 
n|A| 2 times and the cost of each iteration is basically the cost of steps 2.1 and 2.2 in which 
other procedures are called. The cost of calling Nonempty(i?', i\, . . . S x {a})} in step 
2.1 is 0(((n\A\) k + r) k+2 \S\n) where r is | pr^...^ R\. 

The cost of calling 

Nonempty(Fix-values(i2', pr 1 ti, . . . , pr^ ti), 
h, ...,ij,i,Sx {b}) 

in step 2.2 is the sum of the call to Fix-values which is 0({n\A\)( k+1 ^ k+2 ^) and the 
call to Nonempty which is 0(((n\A\) k + r) k+2 \S\n). Therefore, the total cost of an iteration 
of the loop of step 2 is 

O ^(n\A\) k + r y +2 \S\ n + (n\A\)( k+1 ^ k+2 ^ 

Hence, in order to obtain the total running time for the procedure we only need to multiply 
the previous quantify by the number of iterations, which is n|yl| 2 , obtaining 

n\A\ 2 ^{n\A\) k + r)* +2 \S\ n + (n|.4|)( fc+1 ^+ 2 ^ 

Let us take a closer look at the value of r = | pr^. ^. j R\. It is important to notice 
here that the set of possible constraints S that can appear in an instance is infinite and 
henceforth it is not possible to bound the value of j. Consequently, the value of r might 



GENERALIZED MAJORITY-MINORITY OPERATIONS ARE TRACTABLE 



13 



be exponential in the worst case. However, it would be possible to bound the value of j 
and get a polynomial bound for r if a finite subset T of Inv(y) is fixed beforehand and we 
assume that all constraint instances use only constraint relations from T. Such a situation 
is not completely unusual. In fact, a good number of results on the complexity of CSP(T) 
including the pioneering work of Schaeffer f2JJ] assumes T to be finite. By using the procedure 
Next -beta it could be possible to define a polynomial-time algorithm that solves CSP(T) 
for every finite subset T of Inv(y?). However we are aiming here for a more general result. 
To this end, we define a new procedure Next which makes a sequence of calls to Next-beta. 
Procedure Next(i?', i\, . . . ,ij,S) 
Step 1 set I := 0, U t := R' 
Step 2 while I < j do 

Step 2.1 set Ui + \ : = Next-beta(C/;, i\, . . . , U+i, pi\ i+i S) 
Step 2.2 set I := I + 1 

end while 
Step 3 return Uj 

Observe that at each call of the procedure Next -beta in step 2.1, the value of r can 
be bounded by | pr x t S\\A\, and hence the running time of each call to Next-beta can be 



Finally the running time of Next is obtained by multiplying by n (which always bounds 
j) the previous quantity. 

Lemma 4.1. For every n > 1, every n-ary relation R invariant under (p, every compact 
representation R' of R, every ii, . . . , ij E [n], and every j-ary relation S invariant under <p, 
Next(i? ii, . . . S) computes a compact representation of R* = {t G R : pr ilt ^ G S} in 
time O ((n| J 4|)( fc+1 ^ fc+2 ) +2 IS 1 ^" 1 " 3 ) . Furthermore R* is invariant under tp. ■ 

Finally, we have 

Corollary 4.2. Algorithm Solve decides correctly if an instance V of CSP(Inv(<^)) is 
satisfiable in time 0(m(n\A\)( k+1 ^ k+2,>+2 \S*\ k+3 ) where n is the number of variables of 
V ', m is its number of constraints and S* is the largest constraint relation occurring in V.M 
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